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1 A scheduling algorithm for optimization and early planning in high-level synthesis 
Seda Ogrenci Memik, Ryan Kastner, Elaheh Bozorgzadeh, Majid Sarrafzadeh 
January 2005 ACM Transactions on Design Automation of Electronic Systems 

(TODAES), Volume 10 Issue 1 
Publisher: ACM Press 

Full text available: g pdf(235.21 KB) Additional Information: full citation , abstract , references , index terms 

Complexities of applications implemented on embedded and programmable systems grow 
with the advances in capacities and capabilities of these systems. Mapping applications 
onto them manually is becoming a very tedious task. This draws attention to using high- 
level synthesis within design flows. Meanwhile, it is essential to provide a flexible 
formulation of optimization objectives as well as to perform efficient planning for various 
design objectives early on in the design flow. In this work, we ... 

Keywords: Scheduling, bipartite matching, data flow graph, high-level synthesis 



2 Are wait-free algorithms fast? 
Hagit Attiya, Nancy Lynch, Nir Shavit 
July 1994 Journal of the ACM (J ACM), volume 41 issue 4 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms , review 



Full text available: fg| pdf(2.58 MB) 



The time complexity of wait-free algorithms in "normal" executions, where no failures 
occur and processes operate at approximately the same speed, is considered. A lower 
bound of log n on the time complexity of any wait-free algorithm that achieves 
approximate agreement among n processes is proved. In contrast, there exists a non- 
wait-free algorithm that solves this problem in constant time. This implies an &OHgr;(log 
n< ... 



Keywords: approximate agreement, fault-tolerance, wait-free 



Automatic data layout for distributed-memorv machines 
Ken Kennedy, Ulrich Kremer 

July 1998 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 20 Issue 4 
Publisher: ACM Press 
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The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a 
simple yet efficient machine-independent parallel programming model. After the algorithm 
selection, the data layout choice is the key intellectual challenge in writing an efficient 
program in such languages. The performance of a data layout depends on the target 
compilation system, the target machine, the problem size, and the number of available 
processors. This makes the choice of a good layout extreme! ... 

Keywords: high performance Fortran 



4 Fast deterministic consensus in a noisy environment „ J 

James Asp nes 

July 2000 Proceedings of the nineteenth annual ACM symposium on Principles of 
distributed computing 

Publisher: ACM Press 

.- in ^ -I ui 0i mAtiA ha ityiD\ Additional Information: full citation , abstract , references , citings , index 

Full text available: T%\ pdf(1.10 MB) x 

terms 

It is well known that the consensus problem cannot be solved deterministically in an 
asynchronous environment, but that randomized solutions are possible. We propose a 
new model, called noisy scheduling, in which an adversarial schedule is perturbed 
randomly, and show that in this model randomness in the environment can substitute for 
randomness in the algorithm. In particular, we show that a simplified, deterministic 
version of Chandra's wait-free shared-m ... 

Compiler scheduling: Convergent scheduling 
Walter Lee, Diego Puppin, Shane Swenson, Saman Amarasinghe 

November 2002 Proceedings of the 35th annual ACM/IEEE international symposium 
on Microarchitecture 

Publisher: IEEE Computer Society Press 

Full text available: ^ pdf(1 4Q MB) [ |p Additional Information: full citation , abstract , references , citings , index 
Publisher Site 

Convergent scheduling is a general framework for cluster assignment and instruction 
scheduling on spatial architectures. A convergent scheduler is composed of independent 
passes, each implementing a heuristic that addresses a particular problem or constraint. 
The passes share a simple, common interface that provides spatial and temporal 
preference for each instruction. Preferences are not absolute; instead, the interface allows 
a pass to express the confidence of its preferences, as well as pref ... 

Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Adva need 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available: Q pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

Communication scheduling 

Peter Mattson, William J. Dally, Scott Rixner, Ujval J. Kapasi, John D. Owens 
November 2000 ACM SIGARCH Computer Architecture News , ACM SIGOPS Operating 
Systems Review , Proceedings of the ninth international conference 
on Architectural support for programming languages and operating 
systems ASPLOS-IX, Volume 28 , 34 issue 5 , 5 
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Full text available: fm pdfd 49.45 KB) ; 
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The high arithmetic rates of media processing applications require architectures with tens 
to hundreds of functional units, multiple register files, and explicit interconnect between 
functional units and register files. Communication scheduling enables scheduling to these 
emerging architectures, including those that use shared buses and register file ports. 
Scheduling to these shared interconnect architectures is difficult because it requires 
simultaneously allocating functional units to operati ... 

8 - eommunication scheduling B 

Peter Mattson, William J. Dally, Scott Rixner, Ujval J. Kapasi, John D. Owens 
November 2000 ACM SIGPLAN Notices, Volume 35 issue li 

Publisher: ACM Press 

Full text available: g pdfd .12 MB) Additional Information: full citation , abstract , references , index terms 

The high arithmetic rates of media processing applications require architectures with tens - 
to hundreds of functional units, multiple register files, and explicit interconnect between 
functional units and register files. Communication scheduling enables scheduling to these 
emerging architectures, including those that use shared buses and register file ports. 
Scheduling to these shared interconnect architectures is difficult because it requires 
simultaneously allocating functional units to operati ... 

Special issue:" Al in engineering 9 
D. Sriram, R. Joobbani 
April 1985 ACM SIGART Bulletin, issue 92 

Publisher: ACM Press 

Full text available: ||| pdf(8.79 MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement in 
the July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The 
interest being shown in this area is reflected in the sixty papers received from over six 
countries. About half the papers were received over the computer network. 

10 Two-dimensional round-robin schedulers for packet switches with multiple input 
queues 

Richard O. LaMaire, Dimitrios N. Serpanos 

October 1994 IEEE/ACM Transactions on Networking (TON), volume 2 issue 5 
Publisher: IEEE Press 

CIU , i ui « 0j( MD \ Additional Information: full citation , references , citings , index terms . 

Full text available: W\ pdfd .24 MB) — ; s *-' 

vzf—* review 



11 Advances in embedded software scheduling techniques: A low power scheduler 




using game theory 

N. Ranganathan, Ashok K. Murugavel 

October 2003 Proceedings of the 1st IEEE/ACM/IFIP international conference on 

Hardware/ software codesign and system synthesis 
Publisher: ACM Press 

Full text available: pdfd 80.91 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we describe a new methodology based on game theory for minimizing the 
average power of a circuit during scheduling in behavioral synthesis. The problem of 
scheduling in data-path synthesis is formulated as an auction based non-cooperative finite 
game, for which solutions are developed based on the Nash equilibrium function. Each 
operation in the data-path is modeled as a player bidding for executing an operation in 
the given control cycle, with the estimated power consumption as t ... 

Keywords: auction theory, game theory, high-level synthesis, low power design 
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12 Technical reports 
SIGACT News Staff 

January 1980 ACM SIGACT News, Volume 12 issue, l 
Publisher: ACM Press 

Full text available: 1p |pdf(5.28 MB) Additional Information: full citation 
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rostering: runway schedule determination by simulation optimization 
Thomas Curtis Holden, Frederick Wieland 

December 2003 Proceedings of the 35th conference on Winter simulation: driving 
innovation 

Publisher: Winter Simulation Conference 

Full text available: g pdf(446.86 KB) Additional Information: full citation , abstract , references 

Many airport runway expansion projects are restricted by space limitations imposed by 
development in the vicinity of the airport. This often causes planners to choose 
configurations for new runways that limit the use of these runways in time and/or space. 
Studies that model airports with new runways that are not yet operational need to 
develop plausible operational models for these new runways since historical data is not 
available. We look at a runway schedule problem encountered during the ... 

14 University of Maryland student scheduling algorithm B 
James Stewart, Robert L. Clark 

January 1968 Proceedings of the 1968 23rd ACM national conference 
Publisher: ACM Press 

Full text available: ^ pdf(750.84 KB) Additional Information: full citation , abstract , references , index terms 

This report describes the basic algorithm by which students are assigned to classes, the 
various criteria which directed the design of that algorithm, and the consequences of 
some of its features. The University of Maryland Student Scheduling Program consists of 
two separate phases: a catalogue builder and a student scheduler. These programs 
operate on the IBM 7094 and are described in the last section of this report. Since the 
subject of this report is the basic scheduling algor ... 

15 A SMART scheduler for multimedia applications B 
Jason Nieh, Monica S. Lam 

May 2003 ACM Transactions on Computer Systems (TOCS), volume 21 issue 2 
Publisher: ACM Press 

Full text available: g pdf(570.87 KB) Additional Information: full citation , abstract , references , index terms 

Real-time applications such as multimedia audio and video are increasingly populating the 
workstation desktop. To support the execution of these applications in conjunction with 
traditional non-real-time applications, we have created SMART, a Scheduler for 
Multimedia And Real-Time applications. SMART supports applications with time 
constraints, and provides dynamic feedback to applications to allow them to adapt to the 
current load. In addition, the support for real-time applications is integrat ... 

Keywords: Scheduling, multimedia, proportional sharing, real-time 

1 6 A formal study of distributed meeting scheduling: preliminary results B 
Sandip Sen, Edmund H. Durfee 

October 1991 ACM SIGOIS Bulletin , Conference proceedings on Organizational 

computing systems COCS '91, volume 12 issue 2-3 
Publisher: ACM Press 

Full text available: Additional Information: 
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17 Space-time scheduling of instruction-level parallelism on a raw machine | 
Walter Lee, Rajeev Barua, Matthew Frank, Devabhaktuni Srikrishna, Jonathan Babb, Vivek 
Sarkar, Saman Amarasinghe 

October 1998 ACM SIGPLAN Notices , ACM SZGOPS Operating Systems Review , 

Proceedings of the eighth international conference on Architectural 

support for programm ing languages and operating systems ASPLOS- 

VIII, Volume 33 , 32 Issue 11 , 5 

- Publisher: AGM Press - 

r- it • ■ -, u, « .r /H 7ftfc , m Additional Information: full citation , abstract , references , citings, index 

Full text available: TO pdf(1.79 MB) 

u^—^ terms 

Increasing demand for both greater parallelism and faster clocks dictate that future 
generation architectures will need to decentralize their resources and eliminate primitives 
that require single cycle global communication. A Raw microprocessor distributes all of its 
resources, including instruction streams, register files, memory ports, and ALUs, over a 
pipelined two-dimensional mesh interconnect, and exposes them fully to the compiler. 
Because communication in Raw machines is distributed, com ... 

18 A continuum of disk scheduling algorithms I 
Robert Geist, Stephen Daniel 

January 1987 ACM Transactions on Computer Systems (TOCS), Volume 5 issue l 
Publisher: ACM Press 

r- „ * ^ •. ui 0 M,oe* at i^m Additional Information: full citation , abstract , references , citings , index 

Full text available: tli pdf(866.07 KB) :* - — 

l£3 "^ terms , review 

A continuum of disk scheduling algorithms, V(R), having endpoints V(0) = SSTF and V(l) 
= SCAN, is defined. V(R) maintains a current SCAN direction (in or out) and services next 
the request with the smallest effective distance. The effective distance of a request that 
lies in the current direction is its physical distance (in cylinders) from the read/write head. 
The effective distance of a request in the opposite direction is its physical di ... 

19 Distributed visualization of graph algorithms 
Alexander A. Sherstov 

January 2003 ACM SIGCSE Bulletin , Proceedings of the 34th SIGCSE technical 

symposium on Computer science education SIGCSE '03, Volume 35 issue l 
Publisher: ACM Press 

Full text available: pdf(218.85 KB) Additional Information: full citation , abstract , references , index terms 

DisViz is a visualization tool designed to assist students in learning graph algorithms, an 
important topic in the undergraduate curriculum. DisViz is intended for collaborative use 
by a group of students over a classroom network. This visualization system views network 
hosts as graph nodes and the socket connections among them, as graph edges. In typical 
usage, every student runs a copy of DisViz on his/her local machine. These applications 
detect each other's presence on the network and coordi .!. 

Keywords: computer science education, distributed computing, graph algorithms, 
simulation and modeling, visualization 



20 Research sessions: query processing II: Minimal probing: supporting expensive 

predicates for top-k queries 
Kevin Chen-Chuan Chang, Seung-won Hwang 

June 2002 Proceedings of the 2002 ACM SIGMOD i nternational conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 
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This paper addresses the problem of evaluating ranked top-k queries with expensive 
predicates. As major DBMSs now all support expensive user-defined predicates for 
Boolean queries, we believe such support for ranked queries will be even more important: 
First ranked queries often need to model user-specific concepts of preference, relevance, 
or similarity, which call for dynamic user-defined functions. Second, middleware systems 
must incorporate external predicates for integrating autonomo ... 
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